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APPROXIMATING (UNWEIGHTED) TREE AUGMENTATION VIA 
LIFT-AND-PROJECT, PART I: STEMLESS TAP 

JOSEPH CHERIYAN AND ZHIHAN GAO 


Abstract. In Part I, we study a special case of the unweighted Tree Augmentation Problem (TAP) 
via the Lasserre (Sum of Squares) system. In the special case, we forbid so-called stems; these are 
a particular type of subtree configuration. For stemless TAP, we prove that the integrality ratio of 
an SDP relaxation (the Lasserre tightening of an LP relaxation) is < | + e, where e > 0 can be any 
small constant. We obtain this result by designing a polynomial-time algorithm for stemless TAP 
that achieves an approximation guarantee of (| +e) relative to the SDP relaxation. The algorithm is 
combinatorial and does not solve the SDP relaxation, but our analysis relies on the SDP relaxation. 

We generalize the combinatorial analysis of integral solutions from the previous literature to 
fractional solutions by identifying some properties of fractional solutions of the Lasserre system via 
the decomposition result of Karlin, Mathieu and Nguyen (IPCO 2011). 

Also, we present an example of stemless TAP such that the approximation guarantee of | is 
tight for the algorithm. 

In Part II of this paper, we extend the methods of Part I to prove the same results relative to 
the same SDP relaxation for TAP. 


1. Introduction 

In the weighted Tree Augmentation problem we are given a connected, undirected multigraph 
G with non-negative costs on the edges, together with a spanning tree T of G. The goal is to 
find a set of edges, F C E(G)—E(T ), of minimum cost such that the multigraph (V,E(T) U F) is 
2-edge-connected. By a link we mean an element of E(G)—E(T ); thus, a link is an edge of G that 
can be used to augment T. We say that a link uw covers an edge e E E(T ) (a tree-edge) if the 
multigraph T + uw — e = (V, E(T) U {w}-{e}) is connected. * We say that a set of links F covers 
the tree T if every edge of T is covered by at least one link of F; it can be seen that F covers T 
iff the multigraph (V,E(T) U F) is 2-edge-connected. Thus, the goal is to compute a set of links of 
minimum cost that covers T. 

The weighted Tree Augmentation problem was first studied by Frederickson and Jaja in 1981 
[7]. They showed that the problem is NP-hard, and they presented a 2-approxinration algorithm. 
Subsequently, it has been proved that even the unweighted Tree Augmentation problem is APX- 
hard, see [13, Section 4]; thus the unweighted problem has no PTAS assuming P^NP. 

There have been some important advances on this problem for the corresponding unweighted 
(i.e., uniform weight) problem. Following [9], we use the abbreviation TAP for the unweighted Tree 
Augmentation problem. Nagamochi, see [10], presented the first algorithm for TAP that improved 
on the approximation guarantee of 2; the approximation guarantee is ss 1.875. Subsequently, 
Even, et ah, [9] built on the ideas and techniques initiated by Nagamochi and presented an elegant 
algorithm and analysis that achieves an approximation guarantee of 1.8. Even, et ah, explain that 
the threshold of 1.8 is a natural barrier for TAP; this comes from a particular type of subtree 
configuration, a so-called “stem” , that occurs in instances of TAP (see Section 2 for definitions). 
To improve on the approximation guarantee of 1.8, both the algorithm and analysis have to be 
refined to handle stems. This introduces significant new complications. In a conference publication 

Date: August 27, 2015. 

*In other words, a link covers every tree-edge in the unique path of T between the ends of the link. 
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from 2001, Even, et al., reported a 1.5 approximation algorithm for TAP, see the extended abstract 
[5], and recently, Kortsarz Sz Nutov [15] presented the journal version of this result. 

There are several other papers on TAP, see e.g., [1, 17, 3, 6, 8], but we do not discuss them since 
either they are not directly relevant to this paper or they are unpublished manuscripts. ^ 

Linear programming relaxations for the weighted version of TAP have been studied for many 
years. (We use the standard abbreviation LP to mean a linear programming relaxation or a linear 
programming problem.) There is an obvious “covering” LP: we have a variable x e for each edge 
e 6 E(G ) — E(T ) and we have a covering constraint for each edge of T. It is well known that the 
integrality ratio of this LP is < 2; this can be deduced from Jain’s result [11]. A lower bound of 
1.5 on the integrality ratio is known [2]; in fact, the construction for the lower bound uses uniform 
weights for the edges in E(G) — E(T), hence, the lower bound applies for TAP. 

Our results and techniques. In Part I, our main contribution is to apply the Lasserre system 
to a special case of TAP, and to derive some properties of the feasible solutions that are then used 
to analyze the integrality ratio and the approximation guarantee. A stem is a node incident to 
three edges of T that satisfies some additional conditions (see Figure 2 for an illustration, and see 
Section 2 for formal definitions). We use stemless TAP to refer to the special case of TAP where 
the instance is such that the tree T has no stem. In Part I, we focus on stemless TAP, and prove 
that the integrality ratio of an SDP relaxation (the Lasserre tightening of an LP relaxation) is 
< | + e, where e > 0 can be any small constant. 

The Lasserre system applies to an initial LP, and it derives a sequence of tightenings of the 
initial LP; these tightened relaxations are indexed by a number t = 0,1,... called the level, where 
the level 0 tightening means the initial LP. A key “decomposition theorem” (see Theorem 4.1, 
[18, 12]) asserts that a feasible solution at level t can be written as a convex combination of feasible 
solutions at a lower level such that all of these lower-level solutions y are “locally integral.” Here, 
“locally integral” means that there is a specified subset J C E(G) — E(T ), such that the solution 
y takes only zero or one values on this subset (i.e., y e 6 {0,1}, Ve G J). A key point is that 
the difference in levels (between the level t of the given feasible solution and the level of the 
lower-level “locally integral” solutions) does not depend on the size of J, rather, it depends on 
the following “combinatorial parameter” determined by J. Suppose that there exists a constant k 
such that every feasible solution x of the initial LP has < k entries in J that have value one, i.e., 
|{e G J : x e = 1}| < k for every feasible solution x of the initial LP. Then for any t > k, a feasible 
solution at level t can be written as a convex combination of “locally integral” feasible solutions at 
level (t — k). This property does not hold for other weaker Lift-and-Project systems such as the 
Lovasz-Schrijver system or the Sherali-Adams system. 

We formulate an initial LP that is a tightening of the obvious “covering” LP for TAP; see (LPo) 
in Section 3. For this purpose, we introduce the notion of overlapping pairs, and we add a family of 
constraints on overlapping pairs to our initial LP; see Proposition 3.1 in Section 3. This (together 
with the decomposition theorem) turns out to be the key for proving properties of feasible solutions 
to the Lasserre system. 

The analysis of our algorithm is based on a potential function. Our potential function is derived 
from the Lasserre tightening of the initial LP. In contrast, the previous literature uses potential 
functions that are derived from combinatorial lower bounds. We present an example showing that 
our potential function is not valid for our initial LP; in other words, the Lasserre tightening is 
essential for one part of our analysis. 


preliminary version of our paper with a weaker approximation guarantee of (1.75 + e) for TAP relative to the 
same SDP relaxation was circulated widely in July 2014, and it led to subsequent publications by others, e.g., [14], 
but we prefer to avoid discussion on these matters and we leave it to the subsequent publications. 
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Our algorithm is “combinatorial” and we do not need to solve the initial LP nor its Lasserre tight¬ 
ening to run the algorithm (but, the analysis of the algorithm relies on the Lasserre tightening). 
Our algorithm may be viewed as a variant of the algorithm of [9, Section 3.4]; see Section 6 for 
details. The algorithm is a greedy-type iterative algorithm that makes a leaves-to-root scan over 
the tree T and (incrementally) constructs a set of links F that covers T. The algorithm starts with 
F := 0, at each major step it adds one or more links to F (it never removes links from F). and 
at termination, it outputs a set of links F that covers T such that |Fj < the potential function. 
The algorithm incurs a cost of one unit for each link added to F. The key to the analysis is to 
show that for each major step, the cost incurred (i.e., one plus the number of links added to F) is 
compensated by a part of the potential function; see Section 6 for details. 

It is possible that the naive algorithm gets “stuck.” But, in this scenario, we can prove that 
there exists a small combinatorial obstruction. The algorithm can be modified for this scenario. 
The modified algorithm finds each occurrence of the small combinatorial obstruction in polynomial 
time, and then handles all of these occurrences in an appropriate way; see Section 7.2 for details; 
this part of our paper is similar to [9, Section 4.3]. 

Informally speaking, our analysis in Section 7 asserts the following: 

if the naive algorithm gets “stuck” then there exists a small combinatorial obstruc¬ 
tion, a so-called deficient 3-leaf tree, see Theorems 7.3, 7.4. 

This assertion is the key to this paper; it turns out that the algorithmic aspects as well as the 
analysis of the approximation guarantee are straightforward consequences. Our analysis in Section 
7 makes essential use of the Lasserre system and the decomposition theorem; see Figure 1 and also 
see Section 10. 

Finally, we present an example of stemless TAP such that the approximation guarantee of | is 
tight for the algorithm. 



Figure 1 . An illustration of the role of the decomposition theorem for the Lasserre 
system (Theorem 4.1) in our analysis. The analysis for Theorem 7.3 consists of 
three blocks: low-level assertions maintained by the algorithm, intermediate-level 
results (on credits and structural properties), and the high-level analysis (proof of 
Theorem 7.3); the three blocks are shown from left to right in the figure. The 
decomposition theorem pertains to the two right-most blocks. Also, we use the 
decomposition theorem to derive our potential function (Lemma 5.1). 

In Part II of this paper, we extend the methods of Part I to prove the same approximation 
guarantee for (general) TAP. Part II follows the same outline as Part I. Moreover, the initial LP 
and Lasserre tightening are the same for both parts. We mention that Part II is essentially self- 
contained and is “logically independent” of Part I; our motivation for writing Part I is to give an 
accessible presentation of the algorithmic ideas and correctness arguments used in Part II. 
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An outline of Part I is as follows. Section 2 has definitions and notation. We adopt the notation 
and terms of Even, et ah, [9], where possible; this will aid readers familiar with that paper. Section 3 
presents the initial LP, while Section 4 discusses the Lasserre tightening of the initial LP and proves 
some basic properties and inequalities. Section 5 derives our potential function, based on a solution 
y of the Lasserre tightening; this section also has an example showing that our potential function 
is not valid for the initial LP. Section 6 presents the algorithm and the “credit scheme” used by 
the algorithm. The most important component of Part I is Section 7; this section proves the key 
theorem on deficient trees (Theorem 7.3); this section also presents and proves the last piece of the 
algorithm, namely, the handling of deficient trees. Section 8 presents an example of stemless TAP 
such that the approximation guarantee of | is tight for the algorithm. The last section, Section 10, 
has conclusions. 


2. Preliminaries and notation 

This section presents definitions and notation. 

Standard notation including tree T, link set E. Let G = (V, E(G )) be a connected, undirected 
multigraph, and let T = (V, Et) be a spanning tree of G. We assume that |T| >2. By a tree-edge 
we mean an edge of T. Let E denote the edge-set E(G)—Et ; we call E the link set and we call 
an element t € E a link ; thus, a link is an edge of G that can be used to augment T. An instance 
of TAP consists of G and T. We assume that all instances of interest have feasible solutions, that 
is, we assume that (V, Et U E) is 2-edge-connected. The goal is to find a minimum-size subset F 
of E such that augmenting T by F results in a 2-edge-connected multigraph, i.e., the multigraph 
(V, Et U F) is 2-edge-connected. 

For two nodes u, v £ V, we use P UtV = P-, KU to denote the unique path of the tree T between u 
and v. 

For a node v € V, we denote the number of tree-edges incident to v by deg T (u). Let uv be a link 
such that deg r (u) = 1, deg T (u) = 1, there exists an internal node s in P U)V such that deg T (s) = 3, 
and every other node w (if any) in P U)V has deg T (in) = 2. Then, uv is called a twin link and s is 
called a stem. 



Figure 2. Illustration of a stem s and its twin link l. The tree-edges are indicated 
by solid lines and the twin link t is indicated by dashed lines. 


We use stemless TAP to refer to the special case of TAP where the instance is such that the tree 
T has no stems. 

For any U C V, we denote the set of links with both ends in U by E(U), and for any two subsets 
U, IT of V. we denote the set of links with one end in U and the other end in W by E(U, IT); thus, 
E(U, IT) := {uw G E : u € U, w € IT}. 

We say that a link uv covers a tree-edge e if P u , v B e. 

For any tree-edge e € Et, we use ^(e) to denote the set of links that cover e, thus, ^(e) = 
{uv € E : e G P u ,v}- For any node w 6 V, we use St(vj) to denote the set of links incident to w. 
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Shadows and the shadow-closed property. For two links u\V\ and U 2 V 2 , if P Ui , Vl C P u 2) „ 2 , 
then we call u\V\ a shadow (or, sublink ) of U 2 V 2 ■ In particular, if P Ul , Vl Pu 2 ,v 2 , then we call u\V\ 
a proper shadow (or, proper sublink ) of U 2 V 2 - 

For each link uv G E, if all sublinks of uv exist in E. then we call E shadow-closed. Clearly, if 
E is not shadow-closed, then we can make it shadow-closed by adding all sublinks of each of the 
original links. It can be seen that this preserves the optimal value of any instance of TAP. 

Observe that a twin link cannot be a proper sublink of another link, because each end u of a 
twin link has deg r (w) = 1. Thus, when we add sublinks to E to make it shadow-closed, then none 
of the added sublinks can be a twin link. Hence, if we start with a stemless instance of TAP, then 
we do not introduce any stems when we make it shadow-closed. 

Following Even, at ah, see [9, Assumption 2.2], we make the next assumption. 

Assumption: E is shadow-closed. 

Contractions, compound nodes and original nodes. By an original node we mean an element 
of V(T), and by an original link we mean an element of E. 

Recall that the algorithm (incrementally) constructs a set of links F such that, at termination, 
T + F = (V, Et U F) is 2-edge-connected. Throughout, we use F to denote the current solution 
of the algorithm; initially, F := 0. We use the standard notion of contracting a link or a set of 
links, see [9], [4, Chapter 1], Throughout, we use T/F (or, T' := T/F) to denote the “current 
tree” obtained by contracting each of the 2-edge-connected components of T + F = (V, Et U F) to 
a single node. Each of the contracted nodes of T/F is called a compound node , see [9, Section 3.2]; 
thus, each compound node corresponds to a set of two or more nodes of V(T). Each of the other 
(non compound) nodes of T' is called an original node. For this paragraph, given an original node 
v of T, let us use v' to denote the corresponding node of T/F, i.e., if v is an original node of T/F , 
then we have v' = v, otherwise v' denotes the compound node of T/F that contains v. Similarly, 
let us use u'w' to denote the “image” of an original link uw G E w.r.t. the current tree T/F. For a 
set of original links JCB, the image w.r.t. the current tree T/F is {u'w 1 : uw G J,u' w'}. Note 
that original links that have both ends in the same compound node of T/F are discarded since 
they are not relevant for the rest of the execution/analysis. When we discuss the algorithm and its 
analysis (in Sections 6, 7), we may abuse the notation by not distinguishing between an original 
link uw G E and its image u'w' w.r.t. the current tree T/F; similarly, we may not distinguish 
between a set of original links and the set of images of those original links. 

Root, ancestor, descendant and rooted subtrees. One of the nodes r of T is designated as 
the root; thus, we have a rooted tree (T, r). 

Let v be a node of T. If a node w belongs to the path P ViT , then w is called an ancestor of v, 
and v is called a descendant of w. If a descendant w of v is adjacent to v (thus, w 7 ^ v), then w 
is called a child of v, and v is called a parent of w. Clearly, every node (except r) has a unique 
parent. If v has no child, then we call v a leaf ; clearly, if v has no child, then deg r (u) = 1. Note 
that r is not a leaf, even if deg T (r) = 1. Throughout, we use L to denote the set of original leaves; 
we use 1Z to denote V—L, i.e., the set of original non-leaf nodes. 

For any node v, we use T v to denote the rooted subtree of (T, r) induced by v and its descendants. 
(Throughout, we view T as an “oriented tree” rooted at r, and we use the term subtree to refer to 
a rooted subtree.) 

We say that a subtree T v is covered by a set of links J C E if each tree-edge of T v is covered by 
some link of J. 

Property 2.1. Suppose T is a rooted tree. Let T VI and T V2 be two (rooted) subtrees of T. Then 
T V] and T V2 either share no node or one is contained in the other. 
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Proof. Suppose T Vl and T V2 share a node, say w. Then, both v\ and V 2 are ancestors of w. This 
implies that one of v± or V 2 must be an ancestor of the other one. Hence, one of the two subtrees 
T V] , T v2 is contained in the other one (possibly, v\ = V 2 and T Vl = T V2 ). □ 

For any leaf v of T, up(v) denotes a node q in P vr that is nearest to the root and adjacent to v 
via a link; clearly, up(v) exists by the assumptions of feasibility and shadow-closure. 

Vectors and convex combinations. For any vector x £ M E , let ones(x) denote the set of links 
of x-value one, thus ones(x) = {uv £ E : x uv = 1}. 

For a vector x £ and any subset J of E, x(J) denotes Yhe&J x ej and x\j denotes the restriction 
of x to J. Given several vectors v 1 , v 2 , ..., we write one of their convex combinations as Eiez AjiV; 
thus, Z is a set of indices, and we have A* > 0, Vi £ Z, and Eiez A* = 1- 

Remark. For expository reasons, Part I uses definitions of “stem” and “twin link” that are relax¬ 
ations of the definitions in Part II. The two definitions in Part I are independent of the choice of 
the root r. In Part II, we call a node s of the rooted tree (T, r) a stem if s r, s has exactly 
two children, the subtree T s has exactly two leaves, and there exists a link between the two leaves; 
the leaf-to-leaf link is called a twin link. If s is a stem according to Part II, then it must be a 
stem according to Part I, but not vice-versa; a similar statement holds for twin links. Clearly, if an 
instance of TAP is stemless according to Part I, then it is stemless according to Part II as well. 

3. The initial LP 

This section presents our LP (LPq) for TAP. 

Let u\V\ and U 2 V 2 be a pair of links. We call it an overlapping pair of links if (i) P U1 , V1 , Pu 2 ,v 2 
have one or more tree-edges in common, and (ii) either an end of u\V\ is in P U2)V2 , or an end of 
U 2 V 2 is in P U1 , V1 - We call a set of links J an overlapping clique if every pair of links in J is an 
overlapping pair. 

The following LP with constraints on overlapping pairs gives a formulation of shadow-closed 
instances of TAP. 

(LPq) minimize : E %UV 

uv$LE 

subject to : x uv >1 Ve £ Et 

uv^SeIP) 

x Ul vi + x U2V2 <1 V overlapping pairs u\Vi, U 2 V 2 £ E 
0 < x < 1 

Note that it tightens the feasible region of the obvious covering LP for TAP, 
min{x e : x(5e(&)) > l,Ve £ Et, 1 > x > 0}, 

e£E 

because (LPq) has additional constraints for the overlapping pairs. The next result shows that 
(LPq) has the same optimal value as the covering LP, for fractional solutions as well as for integral 
solutions, provided the instance is shadow-closed. 

Proposition 3.1. Consider a shadow-closed instance of TAP. The optimal values of (LPq) and 
the covering LP are the same. Moreover, the best objective value of an integral solution of (LPq) 
is the same as the best objective value of an integral solution of the covering LP. 

Proof. Clearly, any feasible solution of (LPq) is also a feasible solution of the covering LP. We 
claim that there exists an optimal solution of the covering LP that is feasible for (LPq). The first 
statement follows from this claim. 


6 



Let x be an optimal solution for the covering LP that minimizes Y! U veE ^ en S^{Pu,v) ■ x uv , where 
length(P UtV ) denotes the number of tree-edges of P n> „. We show that x is feasible for (LPq). 
Otherwise, suppose that x violates the constraint for an overlapping pair u\V\,U 2 V 2 - W.l.o.g., 
suppose that u\ £ V(P U2 ,v 2 ) and some tree-edge is in both P Ul , Vx and P u2i „ 2 . Then, there is a 
maximal (nonempty) prefix of the edge sequence of P Ul ,vi that is contained in P U2 ,v 2 ', let us denote 
this prefix by P Ul , u *• (Note that the link u*v\ is present because the instance is shadow-closed.) 

Let a denote the original value of x UlVl . Then, we replace the value x UlVl by 1 — x U2V2 , thereby 
enforcing the constraint for the overlapping pair U\V\, U 2 V 2 ■ Moreover, we add a + x U2V2 — 1 to 
the value of x UfVl , and if the new value of x UtVl exceeds 1, then we replace it by 1. It can be 
seen that this preserves the constraints of the covering LP. This procedure decreases the value 
YhuveE l en gth(P u .v )' x uv but does not increase the objective value YIuv^e x uv- This contradicts the 
assumption that x is an optimal solution for the covering LP that minimizes l e ugtii(P M> „) • 

Xuv • 

The last part (on integral solutions) follows from similar arguments, because the above procedure 
maintains integrality. □ 


4. Lasserre tightening and its properties 

We give a brief overview of the Lasserre system, [16]. In fact, our results can be stated and proved 
without going into the formalities of the Lasserre system. Essentially, we apply one well-known 
result about the Lasserre system, namely, the decomposition theorem of Karlin-Mathieu-Nguyen, 
see [12, 18, 19]. The comprehensive recent survey by Rothvofi [19] presents this result and much 
more. 

Let A be an m x n matrix, and let (LP) min{c T x s.t. Ax > b} be a linear programming 
relaxation of a binary integer programming problem; thus, each variable x±,...,x n is binary. We 
use the notation A = (A^)^ e r m i ie r n i. (Below, we may use (LP) to denote both the linear program 
above, and its feasible region; thus LP = {1 6 K" : Ax > b}.) 

An nxn matrix M is called positive semidefinite (p.s.d.) if x T Mx >0, Vi G M n ; M !>= 0 denotes 
that M is p.s.d. 

For a positive integer t and the ground set [n] = {1 ,n}, let Vt denote the family of subsets 
of [n] of size at most t. i.e., Vt = {I : / C [n], |/| < f}; thus, each element of Vt is an “index set” of 
size < t. In this subsection, we use I and J to denote elements of Vt (whereas, in the rest of the 
paper, we use J to denote an arbitrary subset of E(G)). 

The t -th level of the Lasserre system (or, hierarchy), denoted Las*(LP) consists of the vectors 
y £ M 2[nl that satisfy 

M t +\ (y) ■■= (yiuj)i,JeVt+i > °; Mt(y) > 0, £ [m]; 2/0 = 1, 

where M/(y) := I ^(A fi )(y 7u j u{i} ) - (W)(y Iu j) 

\ i =1 

In other words, the (t + l)-th level moment matrix of y. M t +i(y), is required to be p.s.d., and 
moreover, for each of the constraints of (LP), namely, row e(A)x — be > 0, £ £ [m], the associated 
t-t\\ level moment matrix of slacks, Mf(y), is required to be p.s.d. 

The index t of each relaxation in the sequence of tightened relaxations is known as the level ; the 
level of the original relaxation is defined to be zero. For any t = 0(1), it is known that the relaxation 
at level t can be solved to optimality (up to a “small enough” additive error term) in polynomial 
time, assuming that the original relaxation has polynomial size; additional mild conditions may be 
needed. 

Moreover, the relaxation at level n is exact. 


^ I,j£Vt 
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Now, consider our LP (LPo) f° r TAP, and let LASt(LPo) denote the level t tightening of (LPo) 
by the Lasserre system. * Rothvofi, see [18, Theorem 2], formulated the following decomposition 
theorem for the Lasserre system, based on an earlier decomposition theorem due to Karlin-Mathieu- 
Nguyen [12]. (We use this particular formulation and not the original statement of [12]; hence, we 
reference both [12] and [18].) 

Theorem 4.1. Let J C E. Let k be a positive integer such that |ones(x) n J| < k for every feasible 
solution x of (LPq). Then for every feasible solution y G LASt(LPo), where t > k + 1, y can be 
written as a convex combination: y = Yli^z suc -h that x l is in LAS t -k(LPo) and x l \j is integral 
(i.e., x l uv is integral for each uv G J), for all i € Z. 

Lemma 4.2. Let J C E be an overlapping clique. For every feasible solution x of (LPo), we have 
|ones(.x) n J\ < 1. Furthermore, for every level t > 2 and every feasible solution y of LASt(LPo), 
we have y(J) < 1. 

Proof. Let x be a feasible solution of (LPo). Then the overlapping constraints in (LPo) imply that 
| ones(x) n J| < 2. To see this, suppose that |ones(.x) fl J| > 2. Then there exists a pair of links 
u\V\,U 2 V 2 G J, with x uivi = x U2V2 = 1; thus, u\V\,U 2 V 2 is an overlapping pair whose associated 
constraint in (LPo) is violated. 

By Theorem 4.1, y can be written as a convex combination: y = Yliez ^ iX% suc h that x l is 
in LaSi(LPo) and x l \j is integral for each i G Z. Hence, x l (J) < 1 for each i £ Z, because 
|ones(x*) fl J| < 1 and x l \j is integral. Consequently, the convex combination y of x l ,i £ Z, satisfies 
y(J) <1. □ 

Lemma 4.3. Let w be a leaf of T, and let u be an ancestor of iv such that every internal node 
(if any) of P W:U has exactly one child. Let e = vu be the tree-edge of P w ,u that is incident to u 
(possibly, v = w). Then, 8e(&) is an overlapping clique. In particular, 5e(w) is an overlapping 
clique. Moreover, we have y(§E(w)) = 1 for any feasible solution y £ LASt(LPo) where t > 2. 

Proof. Consider any two links f\ q \, /2(?2 £ fe(e). Clearly, each of the links f\q \, /2(?2 must have an 
end in P W;V . Suppose that qi,q 2 are the ends in P u ,. v , and w.l.o.g., assume that q\ is an ancestor 
of q 2 . Then, observe that fiqi, /2(?2 is an overlapping pair, because qi is in Pf 2 , q2 and the tree-edge 
e is in both Pf liqi and Pf 2 , q2 (see Figure 3). Hence, the set of links covering e is an overlapping 
clique. 

If we take e to be the unique tree-edge incident to the leaf w, then it can be seen that 5e(w) = 
5e(I) is an overlapping clique. Moreover, by Lemma 4.2, p(5e(w)) < 1, whereas the constraints of 
(LP 0 ) imply that y(S E (w)) = y(S E (e)) > 1. Therefore, y(S E (w)) = 1. □ 

Recall that the matching polytope of the subgraph induced by the leaves, G(L) = (L,E(L)) is 
given by the following constraints: 

x($E(L){v)) < 1 Vu G L 

x(E(W)) < &ZL 1 VW C L, \W\ odd 
x > 0 

The next result is essentially the result on the matching polytope from the survey of Rothvofi, 
see [19, Lemma 13, Sec 3.3], translated to our setting. 

Lemma 4.4. Let e > 0, and let t > ^ + 1. Suppose that y G Las^LPo ) is a feasible solution to 
the t-th level of the Lasserre system. Then, is in the matching polytope of G(L) = (L, E(L)). 

* Although we defined LASt(LPo) to be a set of vectors in R aI1 11 , in what follows, we abuse the notation and take 
LASt(LPo) to be the projection on the subspace indexed by the singleton sets; thus, we take LASt(LPo) to be a set 
of vectors in 



- h 
/ h 


u 


Figure 3. Illustration of the proof of Lemma 4.3. The solid lines are tree-edges 
and the dashed lines are links. 


Proof. By Lemma 4.3, we have v{Se(L)( v )) < u(^e(v)) < l.Vn € L. Hence, for any set W C L. 
we have y(E(W)) < |W|/2. For “large” odd sets W C L, we will show that y{E[W)) < |W|/2 
implies that 1 , whereas, for non-large odd sets W, we deduce y(E(W)) < ^ W ! 2 1 by 

the decomposition theorem (Theorem 4.1) and local integrality on E(W). 

First, consider odd sets W C L with |IF| > ’ + 1. Clearly, we have y{E(W)) < since 

y{$E(L){v)) < 1,Vn <E L. Also, observe that ^ = ^=^(1 + ppj) < ^-^(1 + j) = 

+ e). Hence, y(E(W)) < + e). 

Now, consider odd sets W C L with \W\ < y + 1- We apply the decomposition theorem, 
Theorem 4.1. Note that for any feasible solution x of (LPq), by Lemmas 4.2, 4.3, we have |ones(.x)n 
E(W) | < \ w \~ x < T < t _ i. Since y is a feasible solution to the t-th level of the Lasserre system, 
y can be written as a convex combination: y = such that x l is in LaSi(LPo) and x 1 \e(w) 

is integral. Note that 6e(w) is an overlapping clique by Lemma 4.3 for every w £ W. Hence, for 
each i £ Z, x l (E(W )) < 1 Consequently, y(E(W)) < \ w ]^ 1 . This completes the proof. □ 


5. Potential function for stemless TAP 
This section presents the potential function that is used in our analysis. 

Let M denote a maximum matching of (L,E(L)); thus, M is a maximum matching of the leaf- 
to-leaf links. By an M-link we mean a link that is in M . Let U denote the set of M-exposed leaf 
nodes, that is, the set of leaves that are not covered by M . 

We will often refer to M and U in the rest of the paper; these are key items for the algorithm 
(Section 6) and its analysis (Section 7). 

Recall that 1Z is the set of non-leaf nodes, i.e., 1Z = V—L. 

Lemma 5.1. Let e > 0 be a constant, and let t > 1 + . Let y £ W E be any feasible solution of 

the t-th level of the Lasserre system, Las^LPo). Then 

(l + e)y(E) > \u\ + l\M\ + ±Y,y( S E(v))- 
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Proof. 


\y{E) = \y{E{L)) + \y{E(L,K)) + \y{E{K)) 

> (2 y(E(L)) - \y{E{L ))) + (y{E{L, K)) + \y{E(L, K))) + y{E(K)) 

= (2 y{E(L))+y(E(L,n)))-^y(E(L)) + {^y(E(L,n))+y(E(n))) 

= X y^E{v)) - ^y{E{L)) + ^ X y( 6 E( v )) 

vGL v£TZ 

> \L\ - ^(1 + e)\M\ + ^ X y^Eiy)) 

v£TZ 

® |C/| + (| - + I !/<<5e(«)), 

v&TZ 

where (1) follows from two facts that i/(5e{v)) = 1 for any v G L by Lemmas 4.3 and y (E(E)) < 
by Lemma 4.4, and (2) follows from the observation that \L\ = \U\ +2\M\. Note that \M\ <\\L\ = 

h T,veLy( S E(v)) < y(E), hence, -§|M| > -ey(E). Thus, we have + e)y(^) > \U\ + ^|Af| + 
z i)e7e 

Lemma 5.1 states a key inequality for any feasible solution of the level -1 tightening of (LPq). 
But, the following example shows that this key inequality does not hold for all feasible solutions 
of (LPq) (without tightening by the Lasserre system). The example has a parameter k. We start 
with a path of length k — 1, then attach a claw (a copy of K\^) at each node of the path. This 
gives the tree T. The link set E consists of a link from one end of the path to the other end, and 
the three links connecting every pair of leaves in each copy of the claw. Note that all the sublinks 
of these links are also contained in E (see Figure 4). 



■k 'k 'k 'k 


Figure 4. Instance for k = 6. The tree-edges of T are indicated by solid lines, and 
the maximal links in E are indicated by dashed lines. 

Define a feasible solution x of {LPq) as follows: the link from one end of the path to the other 
end gets value 1, every link connecting a pair of leaves in each claw gets value ! 2 , and all other links 
get value 0. It is not hard to see that x is feasible for {LPq) and that x{E) = + 1. (Note that 

the optimal value of {LPq) may be < x{E), but our arguments do not use the optimal value.) 

If we pick any node on the path to be the root node, then any maximum matching M C E{L) 
has size k and there are k M-exposed leaves. Thus, ||M| + \U\ = | k, and this quantity is larger 
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than (| + e)x(E) = (| + |e)fc + (| + e) for any e < | and for sufficiently large k. Thus, the inequality 
stated in Lemma 5.1 does not hold for x. 

6 . Algorithm 

This section presents the details of our algorithm and its analysis following the overview given 
in Section 1. The working of the algorithm is illustrated below, see Figure 5. 

We first state our main result for (unweighted) stemless TAP: 

Theorem 6.1. Consider an instance of stemless TAP. Let e > 0 be a constant, and let t > 
max{3, ^ + 1}. The integrality ratio of LaSj(LPo) is < | + e. Moreover, there is a polynomial¬ 
time algorithm for finding a feasible solution of TAP of size < (§ + e)y(E), where y is an optimal 
solution ofLASt(LPo). 

Let y £ be an optimal solution of the f-tli level of the Lasserre system, LASt(LPo), where 
t > max{3, 1 + ^}, where e > 0 is any (small) constant. We take the right-hand-side of the 
inequality in Lemma 5.1 to be our potential function. Thus, our potential function is < (| +e)y{E). 
Our goal is to present an algorithm that finds a set of links F that covers T such that |Fj is < our 
potential function; then, it will follow that \F\ is within a factor of (| + e) of optimal. 

The purpose of the potential function is to provide “credits” to the algorithm. In Section 6.2, 
we distribute the credit (i.e., the potential function) among the nodes and links. 

Recall from Section 1 that the algorithm maintains a set of links F and a current tree T' := T/F. 
Initially, we have F : = 0, and T' := T. 

Consider T' := T/F and the addition of a single link t = vw to F. The new tree T / (Fuji}) can 
be obtained by contracting the unique path of T' between v and w, P'„ w , to a single compound node. 
Besides adding a single link in a major step, the algorithm may add a set of links {i\ = viiv \, £2 = 
V 2 W 2 ,..., Ik = VkWk} such that the union of the paths of T' corresponding to i\, £ 2 , ■ ■ ■, £k forms a 
single connected component, i.e., Uli P' is a connected subgraph of T'. Again, the new tree 
can be obtained by contracting all of these paths into a single compound node. 

The algorithm repeatedly finds a set of links F lter C E—F such that the contraction of F lter in 
the current tree results in a single new compound node, and moreover, the credit available from 
the contraction of F lter is > |T lter | + 1 (the details are discussed below). We add F lteT to F, and 
obtain an updated current tree T'. The algorithm repeats this until T' is a single node, that is, 
until T + F is 2-edge-connected. 

There are two types of link sets that get contracted by a major step. The first type is a singleton 
set, i.e., the major step adds one link to F. The second type is defined via the notion of a semiclosed 
tree. This is discussed in the next subsection. 

6.1. Semiclosed trees. We start by defining the key notion of a semiclosed tree w.r.t. an arbitrary 
matching of the leaf-to-leaf links. This notion is due to Even, et ah, based on earlier work by 
Nagamochi [10]; also, see [9, Definition 2.3]. 

Let T' v be a rooted subtree of the current tree T' = T/F. Let M be an arbitrary matching of 
the leaf-to-leaf links. T' v is called semiclosed w.r.t. M if the following conditions hold: 

(i) Each link in M either has both ends in T' v or has no end in T' v . 

(ii) Every link incident to an M-exposed leaf of T' v has both ends in T' v . (Thus, if T' v / T', 
then none of the links covering the tree-edge between v and its parent is incident to an 
M-exposed leaf of T' v .) 

Let M ( T' v ) denote the set of links in M that have both ends in T' v . 

We define 

Y[M,T v ) := M(T/) \^J{up(w)w : w is an M-exposed leaf of 7j(}; 

11 



thus, we associate a “basic link set” with the pair M, T' r In general, the “basic link set” may not 
be a cover of T' v . 

By a minimally semiclosed tree T' v we mean that T' v is semiclosed but none of the proper rooted 
subtrees of T' v is semiclosed. 

Lemma 6.2 (Even, et ah, [9]). Let T' v be a minimally semiclosed tree w.r.t M. Then T{M,T' V ) 
covers all the tree-edges ofT' v . 

Proof. (The proof is well known, but we include it for the sake of completeness.) Suppose that 
some tree-edge e = pq is not covered by T(M ,T' V ), where q is a child of p. Observe that pq cannot 
be incident to a leaf of T' v since T(M,T' V ) has a link incident to each leaf of T[ ; . Then, consider the 
rooted subtree T " q . It can be seen that T' q is a semiclosed tree w.r.t. M, otherwise, T(M,T’ I1 ) would 
cover e = pq. This contradicts the fact that T' v is minimally semiclosed. □ 

6.2. Credit assignment. Recall that M is a maximum matching of ( L,E(L )), U is the set of 
Af-exposed leaf nodes and 1Z is the set of (original) non-leaf nodes in T. 

We start with the credit given by the potential function of Lemma 5.1, and we maintain the 
following assignment of credits to the nodes of T' := T/F and the links of M: 

• every M-exposed original leaf has one credit, 

• every compound node has one credit, 

• every (original) node v £ 1Z is assigned \y{5E{v)) credit, 

• every M-link has | credit, and 

• the (original) root r has one credit. 

It can be seen that the potential function of Lemma 5.1 suffices for assigning credits to the initial 
tree T' := T, except for the unit credit for the root r. When the algorithm terminates, the tree T' 
becomes a single compound node with one credit. But, this credit will not be used any more, and 
thus we have a surplus of one credit. We assign this surplus credit to the root r at initialization. 

We mention that the nodes or links that get contracted into a compound node are no longer 
relevant for the algorithm or the analysis. In particular, the credit (if any) of such nodes or links 
may be used at the step when they get contracted into a compound node, but after that step, any 
remaining credit of such nodes or links is not used at all. 

We take the credit of a link i w.r.t. the current tree T' to be the credit of the original link 
corresponding to i. 

6.3. Simple contractions and assertions on M . Let t = uw be a link, where u, w are nodes of 
the current tree T', and let P' uw denote the path of T' between u and w. We call i = uw a good 
link if the sum of the following items is > 2: (i) the credit of I = uw, (ii) the number of compound 
nodes in P' uw , (iii) the number of Af-exposed original leaves in P' u w , and (iv) 1 if the root r is an 
original node of P' t w . In other words, if we take the credits associated with y to be “fractional 
credits,” then uw is a good link if the “non-fractional credits” associated with uw and the nodes 
of Pf, w is > 2. 

We define a simple contraction to be one of the following types of single-link contractions. 

• For the current tree, consider a leaf-to-leaf link uw such that each end owns one credit; 
thus each of u, w is either a compound leaf node or an original leaf node that is Af-exposed. 
Observe that uw is a good link. 

• For the current tree, consider an Af-link uw such that the path between u and w in the 
current tree contains at least one compound node. Again, note that uw is a good link. 

Lemma 6.3 (Assertions on Af). Suppose that no simple contractions are applicable. Then 

(1) For every M-link uw, every node in the path between u and w in T' is an original node. In 
particular, in T', both ends of each M-link are original leaf nodes. 
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(2) There exist no links between M-exposed leaves. 

6.4. Good semiclosed trees. For the rest of the paper, unless mentioned otherwise, a semiclosed 
tree means a tree that is semiclosed w.r.t. the matching M. 

Recall that a semiclosed tree is defined w.r.t. an arbitrary matching of the leaf-to-leaf links. We 
chose M C E to be a maximum matching of the leaf-to-leaf (original) links of the (original) tree 
T. But, it is not obvious that the the “image” of M in the current tree T' is a matching of the 
leaf-to-leaf links of T'. 

The image of M w.r.t. T' is {u'w' : uw £ M. u' / w'}. We abuse the notation and use M to 
denote both M and its image w.r.t. T ', and by an M-link of T' we mean the image of an original 
Af-link w.r.t. T'. Whenever we mention semiclosed trees w.r.t. M, we assume that no simple 
contractions (see Section 6.3) are applicable in the current tree T'. Then, Lemma 6.3(1) implies 
that M is a set of leaf-to-leaf links w.r.t. the current tree T'. Hence, semiclosed trees w.r.t. T' and 
M are well defined. 

Let T' v be a rooted subtree of T'. We use Af(T') to denote the set of M --links of T' that have 
both ends in T' v . We use U{T' V ) to denote the set of M-exposed leaves of T ' v , including both original 
leaf nodes and compound leaf nodes. Let C(T ' V ) denote the set of compound non-leaf nodes of T' v . 
Moreover, for any vector x £ W E , we use 4>(a;, Tf) to denote \ Y2 w eV(T')mz x (^E{w)). We define 
the credit of T’ v to be the sum of the credits of the nodes in T’ v plus the sum of the credits of the 
links in M(T' V ). 

Observe that the credit of a semiclosed tree T' v is either 1 + || M (T^)| + |t/(T^)| + |C'(T/,)| + $(y, T ' v ) 
(if r £ V(T' v )lMZ),or l\M(T^)\ + \U(T^\+\C(T^\+^{y,Tf) (if r (£ V{T' v )C\IZ). We call a semiclosed 
tree T ~" v good if its credit is > |F(Af, T' v )\ + 1. 

Lemma 6.4. Let T' v be a semiclosed tree. If at least one of the following conditions is satisfied, 
then T' v is good. 

• T' = V 

• c\t') / 0 

• |M(T')| >2 

• HvX)> 1 

. |M(T')| = 1 and $(y,T’) > l 

Proof. First, suppose that the root r is not in V(T,l) nIZ. This implies that if T' v = T', then r must 
be contained in some compound non-leaf node, hence, C(T' V ) 0. Then, it can be seen that the 

difference between the credit of T' v and + 1 is 

= l\M(T' v )\ + \U(T’)\ + \C(T’)\ + <%,T') - |r(M,T')| - 1 

= \\M(T' V )\ + \U(T')\ + \C(T' V )\ + 4>(y, T' v ) - |M(T')| - \U(T’)\ - 1 

= ^|M(T')| + |C(T')| + $(y,T')-l. 

The above quantity is > 0 if any one of the conditions listed in the lemma is satisfied, hence, the 
result holds. 

If the root r is in V{T' v ) n IZ , then T' v has one more unit of credit, and again it can be seen that 
the result holds. □ 

Consider a minimally semiclosed tree T' ; and suppose that it is good. Then, by Lemma 6.2, 
r(M, T' v ) is a cover of Tand moreover, T ” v has enough credit to pay for the contraction of T(Af, T' v ). 

^Recall that IZ is the set of original non-leaf nodes of T; thus V (T^) H IZ denotes the set of nodes of T' v excluding 
all leaves and all compound nodes. 
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(Informally speaking, a naive algorithm could make progress whenever there exists a minimally 
semiclosed tree that is good.) 

6.5. Algorithm in summary. We give a summary of the algorithm in pseudocode. The critical 
step of the algorithm is to find a good semiclosed tree and a cover of it of appropriate size in 
polynomial time. The details of this step are presented in Section 7. There, we show that if a 
semiclosed tree is not good, then T' ); and its incident links form a subgraph that we call a 
deficient 3-leaf tree (this is the key result of Part I). The algorithm finds all occurrences of deficient 
3-leaf trees in polynomial time, and then computes an auxiliary matching of the leaf-to-leaf links 
of T' that we denote by M new . Then the algorithm finds a minimally semiclosed tree T', w.r.t. 
M new . We prove that T' ; is good (it has enough credits to pay for the contraction of a cover of 
size \T(M,Ty)\) and, moreover, F(M new , T' v ) is a cover of T' v of size |T(M, T^)\. (We mention that 
M new is an auxiliary matching that is used for finding a semiclosed tree T, £ with desired properties, 
and other than that, the algorithm does not refer to M new at all; whereas, the matching M and its 
image M(T') are used throughout the algorithm and its analysis.) 

The algorithm starts with F := 0 (F is the set of links picked by the algorithm) and T' := T (T' 
is the current tree T/F ). 

while T' is not a single node do 

repeatedly apply simple contractions until no simple contractions are applicable; 

find a good semiclosed tree T' ; with a cover J of size |T(M, T^)\ (Algorithm 2 in Section 7 

gives the details for finding such a good semiclosed tree) ; 

add J to F, contract T' v to a new compound node, update T'\ 

end 

Algorithm 1: Find an approximately optimal solution for TAP. 


compound node b 

compound node a . 

. VO Vl . V2\ 



Figure 5. The edges of the tree T rooted at r are indicated by solid lines, and the 
maximal links in E are indicated by dashed lines. The matching M is indicated by 
the thick dashed lines. 

6 .6. Worked example. We give an informal presentation of the working of the algorithm, and 
illustrate it in Figure 5. (Section 8 discusses the working of the formal algorithm on an example.) 
Observe that M consists of the two links uqui, U3U4. Moreover, note that no simple contractions 
apply at the start, and the subtree rooted at v \, T[ n = T Vl , is a good semiclosed tree. In the first 
iteration (of the while loop), T V1 is contracted into the compound node a by adding the M-link 
uqUi and the link uofi (shadow of vqUq) to F; thus, a corresponds to T’ }] . Consider the credits for 
this iteration. The M-link uqUi has § credit and the M-exposed leaf no has 1 credit. It can be 
seen that $(y, ) > ^i/(5e{vi)) > and this gives ^ credit. (The formal algorithm does not refer 

to the “fractional credits” <!>(•, •), but our analysis relies on these; since we have not presented the 
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formal algorithm in full, we refer to <&(-, •) to justify the working of the algorithm.) Hence, we have 
> 3 credits, and this pays for contracting the two links and for assigning 1 credit to the compound 
node a. Next, the subtree rooted at V2 (that has two leaves a,u 2 ) is contracted into the compound 
node 6 via a simple contraction applied to the link aii2] thus, b corresponds to the subtree rooted 
at t >2 ■ Consider the credits for this simple contraction. Each of a and 112 has 1 credit, and these 2 
credits pay for contracting one link and for assigning 1 credit to the compound node b. After this, 
the current tree T' has three leaves, 6 , 113 , 114 . Note that T' is a good semiclosed tree. In the final 
step, T' is contracted into a single compound node by adding to F the links 113114 and br (shadow 
of bu4 corresponding to the original link 1 x 21 x 4 ). Consider the credits for the final step. Each of b 
and r has 1 credit, and the M- link 1x31x4 has | credits. Hence, we have 3 ^ credits, and this pays 
for contracting the two links and for assigning 1 credit to the resulting compound node. Thus, the 
algorithm computes the solution F = {vqV\,uqU\, au2 = v\U2, br = v,2r , 1 x 31 x 4 } of size five; it can be 
seen that there exists an optimal solution of size four. 

7. Analysis of the algorithm 

This section has our main result. Informally speaking, it asserts the following: if a semiclosed 
tree T' v is not good, then T' v (and its incident links) form a deficient 3-leaf tree. 

We assume that we are considering the moment after exhausting simple contractions in the main 
loop of the algorithm. Thus, Lemma 6.3 applies. The analysis mainly consists of two parts. In 
Section 7.1, using local integrality of feasible solutions to the Lasserre system, we show that all 
semiclosed trees are good, except one particular case that gives deficient 3-leaf trees. Section 7.2 
shows how to handle deficient 3-leaf trees. This leads to a polynomial-time algorithm for finding a 
good semiclosed tree T' v and a cover of T' v of size |L(M, Tf)\. 

Deficient 3 -leaf trees. Even, et al., introduced the notion of “deficient trees”, see [9, Defini¬ 
tion 4.7] and Figure 1 of [9]; each of these configurations consists of a rooted subtree with three 
leaves and some incident links. 

Suppose that T' v is a semiclosed tree with exactly three leaves a, 61 , 62 - Clearly, among the nodes 
w of T' v either there is exactly one node with deg t'( w ) = 4 or there are two nodes with degree 3 
in T'. In the latter case, we denote these two nodes by u and q\ moreover, we fix the notation 
such that ix is an ancestor of q, and the leaf b\ is not a descendant of q ; thus, a, 62 (but not 61 ) 
are descendants of q. In the former case, we denote by u the unique node that is incident to four 
tree-edges. We call T' v a deficient 3 -leaf tree if (i) the link 6162 is present and it is in M, (ii) the 
link ab\ is present, and (iii) there exists a link b^w such that w G V(T')—V(Tf). 

Moreover, in the first case (with a unique node u in T 1 ' with deg T /(tx) = 4), if conditions (i)-(iii) 
hold with both labelings ( 61 , 62 ) and ( 62 , 61 ) of the M-link, then we fix the notation such that 
upifif) is an ancestor of up( 6 i). We call 62 the ceiling leaf of the deficient 3-leaf tree. Hence, for 
any deficient 3-leaf tree T' v with ceiling leaf 6 , it can be seen that up(b) is a proper ancestor of v, 
i.e., up( 6 ) is not in T' v . 

We mention that the leaf a may be an original node or a compound node; the properties of the 
algorithm (see Lemma 6.3) ensure that the leaves 61,62 must be original nodes. Note that deficient 
3-leaf trees are defined w.r.t. M. 

7.1. Semiclosed trees are good except deficient 3 -leaf trees. Let T' v be a semiclosed tree. 
We construct an auxiliary graph (in fact, a multigraph, see below) in order to analyze the credits 
available in T' v . We denote the auxiliary graph by AG(Tf). This is a bipartite (multi) graph, and the 
two sets in the node bipartition are denoted by ML(T^) and AUfiFf). The first set consists of the Un¬ 
covered leaves of T',. The second set contains an auxiliary node v (informally speaking, v represents 
the node set V{T')—V(T' v )), as well as all the M-exposed leaves of Tfi thus, AU{T' V ) = {u}U U(T' V ). 
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Figure 6. Illustration of deficient 3-leaf tree. 

We define the edge set of AG(T() as follows: for every \mkpq (w.r.t. T') with p E ML{T' V ), q G U(T' V ), 
the edge pq is in AG(T ' V ), and for every link pq (w.r.t. T') with p G ML{T' V ), q G V(T')—V(T' v ), the 
edge pv is in AG(T' V ). Thus, AG(T' V ) is a multigraph, and every edge in AG(T() corresponds to a 
link (w.r.t. T'). See Figure 7 for an example. 



Figure 7. The left figure shows a semiclosed tree T' v where dashed lines indi¬ 
cate links incident with leaves and the thick dashed line indicates an M-link. The 
right figure shows the auxiliary graph AG(T() and its node bipartition ML(T' V ) = 
{b 1 ,b 2 },AU(T') = {v,a}. 


Lemma 7.1. Suppose that no simple contractions are applicable. Let T’ ; be a semiclosed tree such 
that T( / T', C(T() = 0, and \M(T()\ < 1. Let x be a feasible solution for (LPq). 

(1) If M(T'.) = 0, then d>(x,T^) > 1. Furthermore, T' v is good. 

(2) Suppose that \M(Tf)\ = 1, and \U(T{,)\ > 1. Moreover, suppose that x is integral on 
^w£ML(t;,)^e{w) and that d>(x,T^) < Then, \U(T^)\ = 1 and the auxiliary graph has 
a perfect matching such that the corresponding links cover (all the tree-edges in) T' v , and 
moreover, x(£) = 1 for each of the links l in the perfect matching. 

Proof. We start by stating and proving a key claim. 


Claim 7.2. Let J be a set of links that each have at least one end in T' v and no end at 
an M-covered leaf of T' v ; thus, each link in J has at least one end in V(T' V )—ML(T' V ). 

Then, we have &(x,T() > \x(J). 

This claim follows from the fact that every link in J has an end in V(T() n 77. To see this, first 
note that C(T() = 0, so every non-leaf node of T' v is in V(T' v ) n77. Consider any link i G J; clearly, 
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I has no end at an Af-covered leaf of T' v . If i has an end at a non-leaf node of Tf, then we are 
done. Otherwise, l has an end at an Af-exposed leaf of T' v . Since T' v is semiclosed, i cannot have 
an end in V{T')—V{T' v ). Moreover, by Lemma 6.3(2), no link has both ends at Af-exposed leaves. 
It follows that i has one end in V(T' v ) 0 77. Thus, we proved Claim 7.2. 

Let e v denote the tree-edge between v and its parent; e v is well defined since T' v T'. Let 
J = S E (e v ) U (U u€U (t^e{u)). Then, x(J) = x(S E (e v )) + E« e t/(Tp x ^e(u)) > 1 + \U(T^)\; the 
equation holds because (i) T' v is semiclosed so none of the links in 5 E (e v ) is incident to an Af-exposed 
leaf of T' v , and (ii) by Lemma 6.3(2), no link has both ends at Af-exposed leaves; the inequality 
holds because x(d E (e)) > 1 for every tree-edge e. 

Now, consider the first statement of the lemma. Observe that Af (Tfi) = 0; also, U(Tf) 0 since 
T,y has one or more leaves. By Claim 7.2, we have 4>(x,Z£) > \x{J) > \{1 + \U(T^)\) > 1. Since 
this inequality holds for every feasible solution x of (LPq), it also holds for the feasible solution y 
of Theorem 6.1. Thus, we have <h(y, Tf) > 1. Hence, by Lemma 6.4, T' v is good. 

Finally, consider the second statement of the lemma. Observe that \M(T^)\ = 1 and |C/(T^)| > 1. 
Note that x is integral on U w e ml ( T'.)&e( w )-, hence, every link in this set has x-value 0 or 1. 

Moreover, by Lemma 6.3(1), every Af-covered leaf is an original node. For every w G ML(T ' ; ), 
since S E (w) is an overlapping clique by Lemma 4.3 and x is integral on 5 E (w ), we have x(d E (w )) < 1. 
Therefore, x(6 e (w)) < \ML(T^)\ = 2. 

Let J = J— U«,gml(T') 5 E {w). Then, Claim 7.2 holds for J, hence, 4>(x,Ty) > \x{J). 

Note that x(J) = x(S E (e v )) + J2 u eU(T') x (^e{u)) > 1 + |I7(T^)|; this inequality is the same as 
the inequality used above. Moreover, 

(1) x(J) > x{J) - Y. n J ) > 1 + \ U ( T v)\ - > m? v )\ - 1. 

weML(Tl) 

Thus, we have 4>(x, T’ v ) > \x(J) > \{\U{T' V )\ — 1). 

Clearly, \U(T' V )\ = 1, otherwise, we would have 4>(x', T ? () > f, thus giving a contradiction. Hence, 
we have \ML{T^)\ = 2 = |A17(2^)|. 

Similarly, we claim that each Af-covered leaf w of T' v has a link i w in d E (w) n J of x- value one; 
otherwise, we would have Y1 w &ml(T') x ^e{w) H J) < |AfL(T' l / ,)| — 1 since every link in 6 E (w) fl J 
takes x-value 0 or 1. This would give the same contradiction (see (1) above). 

We claim that the set of links {£ w : w G ML(T' V )} maps to the desired perfect matching 

AM of AG(T'j). Otherwise, one of the nodes of AU(T' V ) would be incident to two links from 
{i w : w G ML(T. £)}, and we would have x(J) = x(S E (e v )) + J2ugU(t;,) x ($e(u)) > 2 + \U(T^)\, and 
this would give the same contradiction (see ( 1 ) above). 

Finally, we claim that T' v is covered by the set of links {£ w : w G AfL(T^)} that maps to AM. 
Otherwise, there exists a tree-edge e of T' v that is not covered by this set of links. Let 5 + (e) denote 
the set of links with positive x-value that cover e. Clearly, each link in <5 + (e) has an end in T/„ and 
moreover, has no end in ML[T. £); to see the latter assertion, note that each node w G AfL(T l / ,) has 
x(5 e (w)) < 1 and x(£ w ) = 1, i.e., the nodes in ML(T' V ) are already “saturated” by the set of links 
that maps to AM. Thus, Claim 7.2 applies to 5 + (e) and we have $(x, T' v ) > ^x(6 + (e)) > 5 , giving 
the same contradiction. □ 

Theorem 7.3. Suppose that no simple contractions are applicable. Let T' v be a semiclosed tree that 
is not good. Then T' v is a deficient 3-leaf tree. 

Proof. Since T' v is not good, Lemma 6.4 implies that C{T') = 0, |Af(T/,)| < 1, and T' v T'. Then 
by Lemma 7.1(1), we further have \M{Tfi)\ = 1. Hence, by Lemma 6.4 again, 4>(y,Ty) < \. 

Let J = U W £ML(Tpd E {w). Note that any node w in ML{T' V ) is original, by Lemma 6.3(1). For 
any feasible solution x G M. E of (LPo), we have |ones(x) n J\ < 2 since |AfL(T')| = 2. Hence, by 
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Theorem 4.1, and the fact that t > 3, y can be written as a convex combination Yliez ^ ixl suc h 
that x l G LASi_ 2 (LP 0 ) and x l is integral on J for each i G Z. If each x l , i G Z, has ^(x l 1 T! u ) > 
then, since y is a convex combination of the x\ we have 4>(y, T^) > This gives a contradiction. 
Hence, there exists an %q G Z such that &(x l ° , T' ; ) < 

Case (1): \M(T' V )\ = 1 ,U{T' V ) = 0. Let uw G M(T ' V ), i.e., uw is an M-link with both ends 
in T' ; . There exists no compound node in the path between u and w in T ', by Lemma 6.3. 
Then, it can be seen that T' ; contains a stem with uw as a twin link. This is a contradiction, 
because we have an instance of stemless TAP. 

Case (2): \M(T' V )\ = 1, \U(T^)\ > 1. By Lemma 7.1(2), we know \U(T^)\ = 1. (For conve¬ 
nience, we will label the nodes of T' v using the same labels as in Figure 6 but our arguments 
do not rely on these particular labels.) We denote the M-exposed leaf by a, and the two 
M -covered leaves by b\, & 2 , i.e., ML(T ' V ) = {b\, 6 2 }. Now, our goal is to show that T' v satisfies 
all the conditions of a deficient 3-leaf tree. 

Since T' v / T', let e denote the tree-edge between v and its parent. By Lemma 7.1(2), 
there exist two links £ v G <5g(e) and £ a £ ^e(a) such that x l °(£ v ) = x l °(£ a ) = 1, these two 
links cover T' ; , and moreover, each of 6 i, 6 2 is incident to exactly one of these two links 
(since the auxiliary graph has a perfect matching formed by these two links). 

If there is only one non-leaf node u in T' v with deg T ,(u) ^ 2 (see Figure 6 (a)), then 
we are done. Otherwise, we have exactly two non-leaf nodes u,q in T' 0 with deg T /(u) ^ 2 
and deg T ,(q) / 2. In fact, we must have deg T ,(u) = 3 = deg T /(^) since T* has exactly 3 
leaves. W.l.o.g., we assume that u is an ancestor of q. Then, T' q has only two leaves. By 
the argument in Case (1), the M-link in T' v cannot have its two ends at the two leaves of 
T'. This implies that one leaf of T q is Af-exposed; thus a is a leaf of T'. W.l.o.g., take the 
other leaf of T' q to be 6 2 . Thus, the third leaf b\ is not in T'. 

Suppose that £ v is incident to b\ and £ a is incident to 6 2 . Then, the tree-edge between q 
and its parent is not covered by these two links (see Figure 8 (a)). This is a contradiction. 
Hence, £ v is incident to & 2 and £ a is incident to b\ (see Figure 8 (b)). Therefore, T' v satisfies 
all the conditions of a deficient 3-leaf tree. 

□ 



7.2. Addressing deficient 3 -leaf trees. Even, et al., see [9, Section 4.3], presented an elegant 
method for addressing deficient 3-leaf trees. We use essentially the same method in this section. 
The key point is to compute an auxiliary matching of the leaf-to-leaf links of T' denoted by M new , 
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and then to find a minimally semiclosed tree T' v w.r.t. AI new . (As mentioned in Section 6.5, M Dew 
and M have different purposes, and M stays the same throughout the execution.) 

For a deficient 3-leaf tree T' w , if T' w is not a proper subtree of another deficient 3-leaf tree, then we 
call Tf, a maximal deficient 3 -leaf tree. By Property 2.1, any two different maximal deficient 3-leaf 
trees are disjoint. To construct M new . we start with M new := M, then we examine each maximal 
deficient 3-leaf tree T' w and we replace the unique link of M(T^) in M new by another leaf-to-leaf 
link. In more detail, consider any maximal deficient 3-leaf tree Tfi, and let the three leaves be a, b, d, 
where a is M-exposed, b is the ceiling leaf, and bd is the unique link in M(T' W )\ we keep the link ad 
in M new instead of the M-link bd (see Figure 9). Since any two different maximal deficient 3-leaf 
trees are disjoint, this replacement takes place independently for each maximal deficient 3-leaf tree. 



Figure 9. Addressing deficient 3-leaf trees by replacing M by M new . Figure (a) 
shows a maximal deficient 3-leaf tree Tf with ceiling leaf b and M-ex posed leaf a. 

We obtain the matching M new from M by replacing the link db by the link da; see 
Figure (6). Tf, is not a semiclosed tree w.r.t. M new since b is M new -exposed. Instead, 

Tp is a minimally semiclosed tree w.r.t M new . Note that T' p is not a deficient 3-leaf 
tree. 

Theorem 7 . 4 . Suppose that no simple contractions are applicable. Let T.'’ v be a minimally semi¬ 
closed tree w.r.t. M new . Then T{. is a qood semiclosed tree w.r.t. M and T’ has a cover T(M new , T’ ) 
of size |r(M,T')|. 

Proof. We start by stating and proving a key claim. 

Claim. If Ty has a node in common with a maximal deficient 3-leaf tree, then Tf 
properly contains this maximal deficient 3-leaf tree. 

Let Ty share a node with a maximal deficient 3-leaf tree T^; let b denote the ceiling leaf of T' w . 
Note that up(b ) is not in T’ w . Observe that b is an M new -exposed node. By the definition of deficient 
3-leaf tree, no subtree of T' w is semiclosed w.r.t. M new . This implies that T’ v cannot be a subtree of 
T' w . Then, by Property 2.1, Ty properly contains T' w . This proves the claim. 

The claim shows that either Ty is disjoint from every maximal deficient 3-leaf tree, or Tf properly 
contains some maximal deficient 3-leaf trees. Thus, Tf cannot be a deficient 3-leaf tree. 

By the construction of M new and the fact that Ty is semiclosed w.r.t. M new . it can be seen that 
Ty is semiclosed w.r.t. M. Moreover, it can be seen that | M(Tf )| = \M Dew {Ty)\ , and hence, we have 
|F(M,P')| = \T(M new ,T' v )\. 

Since Tf is a minimally semiclosed tree w.r.t. M new , Lemma 6.2 implies that T(M new ,Ty ) is a 
cover of T' v . We already showed that the size of T(M new , Tf) is equal to |T(M, 

Finally, observe that Ty is semiclosed w.r.t. M and Ty is not a deficient 3-leaf tree, hence, 
Theorem 7.3 implies that T ? ( is good. This completes the proof. □ 
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The procedure for finding a good semiclosed tree is summarized in the following pseudocode. 


start with M new := M; 

for each maximal deficient 3 -leaf tree T' w do 

let b be the ceiling leaf, a be the M-exposed leaf, and db be the M-link in Tfi; 
update M new by replacing db by da (M new := M Dew —{db} U {da}); 

end 

(note that M new is a matching of the leaf-to-leaf links of T'); 
find a minimally semiclosed tree T,' u w.r.t. M new ; 

T'j is a good semiclosed tree w.r.t. M with a cover T(M new . Tfi) of size \T(M,T^)\ by Theorem 
7 - 4 ; 

Algorithm 2 : Find a good semiclosed tree by addressing all deficient 3-leaf trees. 

The discussion above shows how to find a good semiclosed tree T' with a cover of size |T(M, T’ v )\ 
in polynomial time, for the main loop in our algorithm in Section 6 . Therefore, Algorithm 1 (the 
overall algorithm) runs in polynomial time and returns a solution for TAP with size < (| +e)y(E). 
This proves Theorem 6.1. 

It can be seen that Algorithm 1 (the overall algorithm) can be implemented to run in time 
0(|H| 3 ), assuming that the number of (multi) links in the input is 0(|H| 2 ). This can be achieved 
using simple data structures such as adjacency matrices and adjacency lists; the details are not 
straightforward, but we skip them. 

Theorem 7 . 5 . Algorithm 1 can be implemented to run in time 0 (|H| 3 ). 

8. Tight example for the analysis 

In this section, we present a tight example to show that the approximation guarantee of our 
algorithm cannot be improved beyond |. The example has a parameter k. It consists of one initial 
block and k copies of a repeated block. See Figure 10. 


contract <110,2 to form a 



Initial Block Repeated Block 


Figure 10. Two building blocks of our example. The maximal links are indicated 
by dashed lines. 

Figure 11 shows an instance with k = 3 copies of the repeated block and all maximal links 
in E are shown in the figure. Clearly, our instance has no stems. An instance for large k can 
be constructed by adding more copies of the repeated block. The root node r is always in the 
rightmost copy of the repeated block. 

In our example, the blocks are disjoint in terms of both tree-edges and links. This implies that 
any feasible solution of TAP must cover each block individually. To find an optimal solution, we 
only need to consider each block individually. For the initial block, we need two links to cover the 
three tree-edges. For the repeated block, we again need two links to cover the six tree-edges (see 
the links ai& 2 , 0261 in Figure 10). Hence, an optimal solution of our instance has size 2 k + 2. 
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initial block 


repeated block 


Figure 11. Instance for k = 3. The tree-edges of T are indicated by solid lines, 
and the maximal links in E are indicated by dashed lines. The maximum matching 
M is indicated by thick dashed lines. 


Now consider the execution of our algorithm on this instance. The maximum matching M is 
shown in Figure 11. At the start, there is no good link available for simple contractions. The 
initial block is a minimally semiclosed tree (but not a deficient 3-leaf tree). So this block will be 
contracted via the M-link in it and the link from the Af-exposed leaf to the root of the block. After 
that, we enter into the repeated block (see Figure 10). At this moment, the link aia 2 connects two 
Af-exposed leaves, which implies it is a good link; note that there are no other good links. Hence, 
we apply a simple contraction on the link a±a 2 to form a compound node a (see Figure 10). Then, 
the repeated block forms a minimally semiclosed tree T' v with 3 leaves a, hi, 62 and one Af-link 62^1 
(see Figure 10). Note that T' v is not a deficient 3-leaf tree since there exists no link between an 
Af-covered leaf of T' v and a node not in T' v . Thus T,* is good. In the next step, T/, will be contracted 
via the two links av (shadow of ab\) and 6162 . After that, we enter into another repeated block, and 
we apply the same steps as for the previous repeated block. The algorithm applies these iterations 
till it terminates. 

During the running of the algorithm, we use 2 links for the initial block and 3 links for each 
repeated block (one link for simple contraction and two links for contracting a good semiclosed tree 
with 3 leaves). Hence, the algorithm returns a solution of size 2 + 3 k. Therefore, the approximation 
guarantee of the algorithm is |A|^. When k is sufficiently large, the approximation guarantee 
approaches §. This shows that the approximation guarantee of our algorithm cannot be improved 
beyond |. 

Note that the above instance of TAP has some cut nodes. But, we can modify the construction 
to get a 2-node-connected instance by adding some links to the above instance. We add a link 
from the leaf incident with both maximal links in the initial block (see node 112 in Figure 10) to 
the non-leaf child of the root of the first repeated block (see node 63 in Figure 10). Moreover, 
for each pair of consecutive repeated blocks, we add a link between the non-leaf children of their 
roots. It can be seen that the addition of these links does not change the working of the algorithm. 
Clearly, the addition of these links cannot increase the size of an optimal solution. It follows that 
the approximation guarantee of the algorithm is at least 2 + 2 k ‘ i even f° r 2 -node-connected instances. 

Proposition 8.1. The instance presented above shows that the algorithm in Section 6 cannot 
provide an approximation guarantee better than |. 

9. A preview of Part II 

Our algorithm and analysis for TAP in Part II follow the same global scheme as in Part I, that is, 
we start by deriving a potential function, then we design an algorithm based on the same two types 
of contractions (greedy contractions and the contraction of good semiclosed trees), and we prove a 
similar key assertion (if a semiclosed tree is not good, then it has a particular configuration, namely, 
a deficient tree); the key assertion gives us the algorithmic tools for finding a good semiclosed tree. 



Unfortunately, there are several “small configurations” that cannot be handled directly by this 
scheme, hence, the scheme has to be modified; both the potential function and the algorithm are 
modified, hence, our analysis becomes more complicated. 

Let us illustrate some aspects of the cascade of difficulties by discussing the most obvious dif¬ 
ficulty, namely, stems and twin links; let us apply the potential function (and credits) of Part I, 
that is, the right-hand side of the inequality in Lemma 5.1. Consider a stem s and the subtree T s ; 
recall that T s has exactly two leaves and there is a so-called twin link £ s between these two leaves. 
Suppose that the matching M contains £ s \ thus, t s is an M-link and it has | credits. Moreover, 
suppose that no other credits are available in T s (this is possible, because all the links £ with 
y(£) > 0 may have no ends in T s other than the two M-covered leaves of T s ). Then, we cannot 
apply a (greedy) simple contraction to l s because a simple contraction requires 2 credits. Neither 
can we contract the semiclosed tree T s since it is not good. 

To get around this difficulty, Nagamochi [10] and Even et ah, [9], redefine M to be a maximum 
matching of the leaf-to-leaf links that are not twin links; we follow this plan in Part II, but in 
addition to excluding twin links from M, we exclude another type of link, the so-called buddy links. 
Buddy links are defined in Part II; they are associated with a particular configuration that consists 
of a tree T v with exactly three leaves and the links incident to T v ; each buddy link is associated 
with a leaf, and this leaf is called a bud. Thus, M is a maximum matching of a subset of the 
leaf-to-leaf links, where the subset excludes twin links and buddy links. 

Redefining M in this way affects the potential function. Recall that the coefficient of an M-link 
is | and the coefficient of an M-exposed node (a node in U ) is one, see Lemma 5.1. Observe 
that each unit “decrease” in |M| “increases” the number of M- -exposed nodes, \U\, by 2, and thus 
the potential function would “increase” by We cannot allow the potential function to increase 
(otherwise, Lemma 5.1 could fail since the potential function could exceed (| + e)y(E)), hence, we 
compensate for the increase via the term 2 Y^vgtz v(Se(v)). 

Informally speaking, in Part II, a twin link i s contributes 2 y(£ s ) to the potential function (rather 
than | y(£ s )), and we “pay” for the extra \y(£ s ) from the y- values of the links incident with the 
associated stem s of £ s \ similarly, a buddy link £ contributes 2 y(£) to the potential function, and 
we “pay” for the extra \y{£) from the y-values of the links incident with the bud b associated with 
£. 

Although this “fixes” the difficulties caused by excluding twin links and buddy links from M, 
it causes new difficulties. Observe that a stem s may no longer have |?/(<5 e(s)) credits, hence, the 
presence of a stem in a semiclosed tree T v could imply that T v lacks enough credits to be good. 
Similarly, a node w £ 1Z may not have ^y(5E(w)) credits, because it may be incident to a link bw 
such that b is a bud and we already used y(bw ) to “pay” for the buddy link of b ; hence, the presence 
of such a node w in a semiclosed tree T v could imply that T v lacks enough credits to be good. See 
Part II for more discussion on these issues. 

Three types of “small configurations” have to be handled in Part II, namely, (i) semiclosed trees 
with three leaves and one stem that are associated with buds and buddy links, (ii) semiclosed trees 
with four leaves, one stem, and one Af-link, and (iii) semiclosed trees with four leaves, two stems, 
and one Af-link. We use different methods to handle (i)-(iii). As we sketched above, we redefine 
Af and the potential function in order to handle (i), and in addition, we have to handle a cascade 
of difficulties caused by buddy links; thus, the whole analysis of Part II gets complicated due to 
the handling of (i). For (ii), either the semiclosed tree is good (so we have sufficient credits to 
contract it), or it is a so-called deficient 4-leaf tree, and we handle it similarly to the handling of 
deficient 3-leaf trees in Part I. (In Part II, we show that a semiclosed tree with at least 5 leaves 
is good, hence, we have only two types of deficient trees, those with three leaves and those with 
four leaves.) For (iii), either the semiclosed tree is good, or it is a so-called bad 2-stem tree, and 
we address the latter using a preprocessing step that contracts all bad 2-stem trees; moreover, we 
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have to modify the potential function to ensure that the preprocessing step has sufficient credits 
for these contractions. 

In Part II, the main loop of the algorithm is similar to that of Part I, except that the simple 
greedy contractions (single-link contractions) of Part I are replaced by (up-to-5) greedy contractions 
(contractions of a set of links of size < 5). 

In summary, Part II differs significantly from Part I, and some of these differences pertain to 
the revised potential function, the preprocessing steps, the greedy contractions, the handling of 
deficient trees, and the handling of buddy links (and side effects). 

10. Conclusions 

In Part I, we proved that the integrality ratio of an SDP relaxation of stemless TAP is < | + e, 
where e > 0 can be any small constant. Our analysis is based on the Lasserre tightening of an 
initial LP. The stemless property is used only in the proof of Theorem 7.3. Part II of the paper goes 
deeper into the techniques employed in Part I and proves the same result relative to the same SDP 
relaxation for (general) TAP. The main result of Part II is a polynomial-time algorithm for finding 
a feasible solution of TAP of size < (| +e)y(E), where y is an optimal solution of Las^LPo), e > 0 
is any small constant, and t = max{17, |~^-~| +1}. 

Our methods rely on the Lasserre system and the decomposition theorem. In fact, we do not 
know how to prove an approximation guarantee of (| + e) for TAP relative to a mathematical 
programming relaxation without using the decomposition theorem (Theorem 4.1). Our critical use 
of the decomposition theorem is in proving the assertion “if a semiclosed tree T[, is not good, then 
T' v (and its incident links) form a deficient 3-leaf tree,” see Theorem 7.3 in Section 7. We proved 
Theorem 7.3 by applying Theorem 4.1 to decompose a fractional solution y of the Lasserre system 
into feasible solutions that are integral over a particular set of links J (local integrality). Note that 
the size of J need not be 0(1), and it is possible that \J\ = D(|P|). Our proof of Theorem 7.3 
makes essential use of the local integrality property on link sets of “unrestricted” size; of course, 
the key to our approach is to show that ones(x) 0 J has size 0(1) for every feasible solution x 
of our initial LP. (Part II uses the decomposition theorem in a similar way.) Other well known 
Lift-and-Project systems such as the Lovasz-Schrijver system and the Sherali-Adams system are 
weaker than the Lasserre system, and, to the best of our knowledge, the local integrality property 
used in our proof of Theorem 7.3 does not hold for 0(1) levels of these Lift-and-Project system. 

We also use the Lasserre system and the decomposition theorem to derive our potential function, 
see Section 5, but this use of the decomposition theorem may be “bypassed” because our potential 
function may be derived using weaker Lift-and-Project systems such as the Lovasz-Schrijver system 
or the Sherali-Adams system. Another way for deriving the potential function is based on formu¬ 
lating a stronger LP relaxation by adding constraints such as the “matching polytope” constraints 
on the leaf-to-leaf links. 
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